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DATE- 10/25/2001 
RAW SEQUENCE LISTING • ,/.g.QS 

PATENT APPLICATION: US/09/940,166 TIME: 10.19.05 

input set : N:\Crf3\RULE60\09940166.txt 
Output set: N:\CRF3\10252001\l940166.raw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (1) APPLICANT: Blank, Gregory S. 

g Narindray, Dal jit S. 

J Zapata, Gerardo A. 

9 (ii) TITLE OF INVENTION: Protein Recovery 

11 (iii) NUMBER OF SEQUENCES: 7 

13 (iv) CORRESPONDENCE ADDRESS: 

-L4 (A) ADDRESSEE: Genentech, Inc 

^5 (B) STREET: 1 DNA Way 

(C) CITY: South San Francisco 

(D) STATE: California 

18 (E) COUNTRY: USA 

19 (F) ZIP: 94080 

21 (V) COMPUTER READABLE FORM: 

22 (A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 
2 3 (B) COMPUTER: IBM PC compatible 

24 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

25 (D) SOFTWARE: WinPatin (Genentech) 

27 (vi) CURRENT APPLICATION DATA: 

28 (A) APPLICATION NUMBER: US/09/940,166 

29 <B) FILING DATE: 27-Aug-2001 

30 (C) CLASSIFICATION: 

32 (vii) PRIOR APPLICATION DATA: 

33 (A) APPLICATION NUMBER: 09/097,309 

34 (B) FILING DATE: 13-JUN-1997 
36 (viii) ATTORNEY/AGENT INFORMATION: 
3y (A) NAME: Schwartz, Timothy R. 
3Q (B) REGISTRATION NUMBER: 32171 
39 (C) REFERENCE/DOCKET NUMBER: P1105R1 

41 (ix) TELECOMMUNICATION INFORMATION: 

42 (A) TELEPHONE: 650/225-7467 

43 (B) TELEFAX: 650/952-9881 

44 (2) INFORMATION FOR SEQ ID NO: 1: 

46 (i) SEQUENCE CHARACTERISTICS: 

47 (A) LENGTH: 241 amino acids 
43 (B) TYPE: Amino Acid 
49 (D) TOPOLOGY: Linear 

51 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: ^ ^ ^ 

53 Glu ^al Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 

c^/i 1 5 -"-^ 

56 Gly ser Leu Arg Leu Ser Cys Ala Thr Ser Gly Tyr Thr Phe Thr 

59 Glu Tyr Thr Met uTs Trp Met Arg Gin Ala Pro Gly Lys Gly Leu 

60 " 



62 Glu Trp val Ala GlJ He Asn Pro Lys Asn Gly Gly Thr Ser His 

63 50 55 
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BAW SEQUENCE LISTING 10-19:05 
PATENT APPLICATION: US/09/940,166 TIME. iU.xy 
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output set: N:\CRF3\10252001\l940166.raw 

.sn Gin Arg Phe Met Asp Arg Phe Thr lie Ser Val Asp Lys Ser 

65 



Thr ser Thr Ala Tyr Met Gin Met Asn Ser Leu Arg Ala Glu Asp 



Thr Ala val Tyr Tyr Cys Ala Arg Trp Arg Gly Leu Asn Tyr Gly 

Phe ASP val Arg T^r Phe Asp Val Trp Gly Gin Gly Thr Leu Val 
110 ^^-^ 

Ai= COT Thr Lvs Glv Pro Ser Val Phe Pro Leu 
Thr val ser Ser Ala Ser Thr Lys t-^ 
125 -L-^u 
pro s„ S.r Lys ser Thr ser aly =1, Thr Al. I.eu Gly 

cy= l„u val Lya IZ Tyr Phe Pro Olu Pro v.l Thr val ser Trp 

155 

.sn ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val 
Leu Gin ser Ser oly Leu Tyr Ser Leu Ser Ser Val Val Thr Val 

pro ser Ser Ser ^eu Gly Thr Gin Thr Tyr He Cys Asn Val Asn 

700 '^^^ 

His .ys pro ser ..n Thr Lys val Asp Lys Lys val Glu Pro Lys 

215 zzk) 
T.r^ rvhr^ TTiq Thr CVS Pro Pro Cys Pro Ala Pro Glu 
Ser cys Asp Lys Thr His inr *^y:= ^40 



95 nx=, - 220 

96 ^-^^ 

98 t^e-L '^:f=> - 

99 230 235 

101 Leu 

102 241 _ 
104 (2) INFORMATION FOR SEQ ID NO : 2: 
106 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 amino acids 

-L08 (B) TYPE: Amino Acid 

3^09 (D) TOPOLOGY: Linear 

111 
113 
114 
116 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
Asp^li Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 

Gly ASP Arg Val Thr He Thr Cys Arg Ala Ser Gin Asp He Asn 

20 

.s„ Tyr Leu .s„ Trp Tyr Gl„ Glu Lys Pro Gly Lys Ala Pro Lys 

Leu Leu Lie Tyr Thr ser Thr Leu His ser Gly v.l Pro ser 

50 

Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Tyr Thr Leu Thr lie 

ser ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin Gin 

Gly Asn Thr Leu Pro Pro Thr Phe Gly Gin Gly Thr Lys Val Glu 

lie Lys Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro 

110 ^-^"^ 
ser ASP Glu Gin Leu Lys Ser Gly' Thr Ala Ser Val Val Cys Leu 



120 

122 

123 

125 

126 

128 

129 

131 

132 

134 

135 

137 
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135 



125 

Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val 
ASP Asn Ala Leu c'ln Ser Gly Asn Ser Gin Glu Ser Val Thr Glu 



155 160 



138 
140 
141 
143 
144 
146 
147 
149 
150 
152 
153 

155 Arg Gly Glu Cys 

156 214 

158 (2) INFORMATION FOR SEQ ID NO : 
160 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 
-Lg2 (B) TYPE: Amino Acid 

-^^3 (D) TOPOLOGY: Linear 



Gin ASP ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr 

Leu ser Lys Ala ^'sp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu 

185 1^^ 
val Thr His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn 

200 205 



(^\\ SEOUENCE DESCRIPTION: SEQ ID NO: 3: 
i67 Leu^G^i gS Trg Met Lys Gin Leu Glu Asp Lys Val Glu Glu Leu 

5 

Lei ser Lys Asn Tyr His Leu Glu Asn Glu Val Ala Arg Leu Lys 



170 
171 



173 Lys Leu Val Gly Glu Arg 



174 



W- 



35 36 

17 6 (2) INFORMATION FOR SEQ ID NO: 4: 
17 8 (i) SEQUENCE CHARACTERISTICS: 

3^79 (A) LENGTH: 7 amino acids 

3^30 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 
183 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

185 Leu Xaa Xaa Xaa Xaa Xaa Xaa 

186 1, 5 ^ ^ 
188 (2) INFORMATION FOR SEQ ID NO: 

190 (i) SEQUENCE CHARACTERISTICS: 

191 (A) LENGTH: 2143 base pairs 

192 (B) TYPE: Nucleic Acid 

193 (C) STRANDEDNESS : Single 

194 (D) TOPOLOGY: Linear 

196 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

1QQ TAAT^^AACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC 50 

?SJJgc?gI gSgSattt AAGCTTTGGA GATTATCGTC ACTGCAATGC 100 

™AA?AT GGCGCAAAAT GACCAACAGC GGTTGATTGA TCAGGTAGAG 150 

^rrrrrCTGT ACGAGGTAAA GCCCGATGCC AGCATTCCTG ACGACGATAC 200 

gSgSgc?g cg?ga??^g TAAAGAAGTT ATTGAAGCAT CCTCGTCAGT 250 

S^agS ?S?ttcaac agctgtcata aagttgtcac ggccgagact 300 
^^?^gtcgct ttgtttttat tttttaatgt atttgtaact agaattcgag 350 

ScGCCGGGG ATCCTCTAGA GGTTGAGGTG ATTTTATGAA AAAGAATATC 400 
G^S^TCTTC TTGCATCTAT GTTCGTTTTT TCTATTGCTA CAAACGCGTA 450 



201 
203 
205 
207 
209 
211 
213 
215 
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TPATCTTCCC GCCATCTGAT GAGCAGTTGA AATCTGGAAC TGCCTCTGTT B^U 
gSJgccS ?gaataactt CTATCCCAGA GAGGCCAAAG TACAGTGGAA 900 

^i=iil=^i 

SgSSS Lgccggacg catcgtggcg ctagiacgc* »=™»cgt. 0 

s™s s= f™s??r, = 0 

So SgS 1= S 

i i =s =fc^. s= 

™Saa ^Sgataa ATCCACCAGT ACAGCCTACA TGCAAATGAA 1500 



.,2 <xl, f»"=f/^f/SrpJf lefLr.'J^'ser Met Phe v.l Phe 

294 Met Lys Lys Asn He Ala pne ueu i.«u 

297 Ser He Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 

300 pro ser Ser l;' Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 

fo3 Cys Arg A^la Ser Gin Asp He Asn Asn Tyr Leu Asn Trp Tyr Gin 

3^6 Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Tyr Thr Ser 
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40 
His 
55 





45 










50 




OCX 


Pro 


Ser 


Arg 


Phe 


Ser Gly 


Ser 


\jA.y 


60 








U -J 


Pin 


Asp 


Thr 


He 


Ser 


Ser 


Leu 


Gin 


Pro 


75 










OU 




Thr 


Gin 


Gin 


Gly 


Asn 


Thr 


Leu 


Pro 


Pro 


90 












Ala 


Val 


Glu 


He 


Lys 


Arg 


Thr 


vax 


i\J-Ci 


105 










110 




Ser 


Pro 


Pro 


Ser 


Asp 


Glu 


Gin 


T.on 


Lys 


120 










125 




Arg 


Cys 


Leu 
135 


Leu 


Asn 


Asn 


Phe 


iyx 
140 


Pro 


Lys 


Val 


Asp 


Asn 


Ala 


Leu 


Gin 


Ser 


Gly 


150 










155 




Thr 


Thr 


Glu 


Gin 


Asp 


Ser 


Lys 


Asp 


Ser 


165 










170 




Glu 


Leu 


Thr 


Leu 


Ser 


Lys 


Ala 


Asp 


Tyr 


180 










185 




Ser 


Cys 


Glu 
195 


Val 


Thr 


His 


Gin 


Gly 
200 


Leu 


Phe 


Asn 
210 


Arg 


Gly 


Glu 


Cys 
214 









307 
309 
310 

312 Gly Thr Asp 

313 70 

315 Phe Ala Thr 

316 85 

318 Phe Gly Gin 

319 100 

321 Pro Ser Val 

322 115 

324 Gly Thr Ala 

325 130 

327 Glu Ala Lys 

328 145 

330 Asn Ser Gin 

331 160 

333 Tyr Ser Leu 

334 175 

336 Lys His Lys 

337 190 

339 Ser Pro Val 

340 205 
342 (2) INFORMATION FOR SEQ ID NO: 7: 

344 (i) SEQUENCE CHARACTERISTICS: 

345 (A) LENGTH: 300 amino acids 
345 (B) TYPE: Amino Acid 

34 7 (D) TOPOLOGY: Linear 

..^Z l^^r^TlXZ^ »e. V.J P. 
ser lie ^la Z A»n Ala Tyr Ala Glu Val Gl„ Leu Val Glu ser 

_ c 1 



Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 

360 Ala Thr Ser Gly Tyr Thr Phe Thr Glu Tyr Thr Met His Trp Met 

363 Arg Gin A^la Pro Gly Lys Gly Leu Glu Trp Val Ala Gly He Asn 

366 pro Lys Asn Gly Gly Thr Ser His Asn Gin Arg Phe Met Asp Arg 

369 Phe Thr I'le Ser Val Asp Lys Ser Thr Ser Thr Ala Tyr Met Gin 

372 Met Asn sIr Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

375 Arg Trp Arg Gly Leu Asn Tyr Gly Phe Asp Val Arg Tyr Phe Asp 

378 val Trp Oil Gin Gly Thr Leu ^al Thr Val Ser Ser Ala Ser Thr 

379 115 120 
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VERIFICATION SUMMARY DATE: 10/25/2001 

PATENT APPLICATION: US/09/940,166 TIME: 10:19.06 

Input set : N:\Crf3\RUI,E60\09940166.txt 
Output set: N:\CRF3\10252001\l940166.raw 

L:28 M:220 C: Keyword misspelled or invalid format, [(A) f^^^l^^ll^^^^^^ 
L-29 M-220 C: Keyword misspelled or invalid format, [(B) FILING DAib.j 
l':185 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 
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